RDF with regular expressions
نویسندگان
چکیده
RDF is a knowledge representation language dedicated to the annotation of resources within the framework of the semantic web. Among the query languages for querying an RDF knowledge base, some, such as SPARQL, are based on the formal semantics of RDF and the concept of semantic consequence, others, inspired by the work in databases, use regular expressions making it possible to search the paths in the graph associated with the knowledge base. In order to combine the expressivity of these two approaches, we define a mixed language, called PRDF (for "Paths RDF") in which the arcs of a graph can be labeled by regular expressions. We define the syntax and the semantics of these objects, and propose a correct and complete algorithm which, by a kind of homomorphism, calculates the semantic consequence between an RDF graph and a PRDF graph. This algorithm is the heart of query answering for the PSPARQL query language, the extension of the SPARQL query language which we propose and have implemented: a PSPARQL query allows to query an RDF knowledge base using graph patterns whose predicates are regular expressions. Key-words: semantic web, query language, RDF, SPARQL, regular expressions. in ria -0 01 44 92 2, v er si on 4 22 M ay 2 00 7 RDF avec des expressions régulières Rapport de recherche INRIA Résumé : RDF est un langage de représentation de connaissances dédié à l’annotation de ressources dans le cadre du web sémantique. Parmi les langages de requêtes permettant d’interroger une base de connaissances RDF, certains, tels que SPARQL, s’appuient sur la sémantique formelle de RDF et la notion de conséquence sémantique, d’autres, inspirés par des travaux en bases de données, utilisent des expressions régulières permettant de chercher des chemins dans le graphe associé à la base de connaissances. Afin de conjuguer l’expressivité de ces deux approches, nous définissons un langage mixte, appelé PRDF (pour "Paths RDF") dans lequel les arcs d’un graphe peuvent être étiquetés par des expressions régulières. Nous définissons la syntaxe et la sémantique de ces objets, et proposons un algorithme correct et complet qui, par une sorte d’homomorphisme, calcule la conséquence sémantique entre un graphe RDF et un graphe PRDF. Cet algorithme est au cœur de PSPARQL, l’extension du langage de requêtes SPARQL que nous proposons et avons implémenté: une requête PSPARQL permet d’interroger une base de connaissances RDF en utilisant des patterns dont les prédicats sont des expressions régulières. Mots-clés : web sémantique, langage de requête, RDF, SPARQL, expressions régulières. in ria -0 01 44 92 2, v er si on 4 22 M ay 2 00 7 PSPARQL Query Language 3
منابع مشابه
Towards an RDF Validation Language Based on Regular Expression Derivatives
There is a growing interest in the validation of RDF based solutions where one can express the topology of an RDF graph using some schema language that can check if RDF documents comply with it. Shape Expressions have been proposed as a simple, intuitive language that can be used to describe expected graph patterns and to validate RDF graphs against those patterns. The syntax and semantics of S...
متن کاملnSPARQL: A Navigational Language for RDF
Navigational features have been largely recognized as fundamental for graph database query languages. This fact has motivated several authors to propose RDF query languages with navigational capabilities. In particular, we have argued in a previous paper that nested regular expressions are appropriate to navigate RDF data, and we have proposed the nSPARQL query language for RDF, that uses neste...
متن کاملnSPARQL : A Navigational Language for RDF 1
Navigational features have been largely recognized as fundamental for graph database query languages. This fact has motivated several authors to propose RDF query languages with navigational capabilities. In this paper, we propose the query language nSPARQL that uses nested regular expressions to navigate RDF data. We study some of the fundamental properties of nSPARQL and nested regular expres...
متن کاملValidating and Describing Linked Data Portals using RDF Shape Expressions
In order to improve the quality of linked data portals, it is necessary to have a tool that can automatically describe and validate the RDF triples exposed. RDF Shape Expressions have been proposed as a language based on Regular Expressions that can describe and validate the structure of RDF graphs. In this paper we describe the WebIndex, a medium sized linked data portal, and how we have emplo...
متن کاملThe Perfect Match: RPL and RDF Rule Languages
Path query languages have been previously shown to complement RDF rule languages in a natural way and have been used as a means to implement the RDFS derivation rules. RPL is a novel path query language specifically designed to be incorporated with RDF rules and comes in three flavors: Node-, edgeand path-flavored expressions allow to express conditional regular expressions over the nodes, edge...
متن کامل